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|/5 (54) Title: CLONING, SEQUENCING AND EXPRESSION OF A COMAMONAS CYCLOPENTANONE 1,2-MONOOXYGE- 
Tf NASE-ENCODING GENE IN ESCHERICHIA COLI 



(57) Abstract: Cyclopentanone 1 ,2-monooxygenase (CPMO) from Comamonas (previously Pseudomonas) sp. strain NCIMB 9872 
2 carries out the second step of a degradation pathway that allows the bacterium to use cyclopentanol as a sole carbon source for 
growth. In the present invention there is reported the localization of the CPMO-encoding gene (cpnB) on a 4.3-kb Sphl fragment, the 
O determination of its sequence. The 550-amino acid CPMO polypeptide (Af r , 62, 1 1 1) encoded by the gene was found to have 36.5% 
identity with the sequence of cyclohexanone 1 ,2-monooxygenase (CHMO) of Acinetobacter sp. strain NCIMB 9871. The 62-kDa 
^ CPMO was expressed in E. coli as an IPTG-inducible protein. 
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CLONING. SEQUENCING AND EXPRESSION OF A COMAMONAS 
CYCLOPENTANONE 1 ,2-MONOOXYGEN ASE-ENCODING GENE IN 

ESCHERICHIA CPU 

TECHNICAL FIELD 

5 The present invention relates to an isolated DNA encoding a 

cyclopentanone monooxygenase (CPMO), or an enzymatically active portion 
thereof, and expression vector and a transformed cell containing the isolates 
DNA. 

BACKGROUND OF THE INVENTION 

l o Comamonas (previously Pseudomonas ) sp. NCIMB 9872 was one of the 

few microorganisms that have been characterized to produce a Baeyer-Villiger 
monooxygenase (BVMO; Griffin, M., et al., Biochem. J. 129:595-603, 1972; 
Griffin, M., et al., Eur. J. Biochem. 63:199-209, 1976; and Willetts, A., Trends in 
Biotech. 15:55-62, 1997; for a recent review). BVMOs are flavoproteins that 

15 mimic the classical Baeyer-Villiger organic chemical reaction which is a peracid- 
catalyzed oxidation of a ketone to an ester or lactone. The use of enzyme 
substitutes for the production of lactones in high yield and optical purity is an 
attractive feature in current trends of research and development toward 
replacing chemical methods with biological alternatives (Stinson, S.C., Chem. 

20 Eng. News, 83-104, 1998). To date, the best characterized BVMO enzyme is 
that of cyclohexanone monooxygenase (CHMO) produced by Acinetobacter sp. 
NCIMB 9871 (Stewart, J.D., Curr. Org. Chem. 2:195-216, 1998; Willetts, A., 
Trends in Biotech. 15:55-62, 1997). This is also the only BVMO whose gene 
has been cloned and sequenced (Chen, et al., J. Bacterioi. 170:781-789, 1988). 

25 Recently, this valuable resource was used to engineer a "designer yeast" in a 
whole-cell approach to effect a variety of asymmetric Baeyer-Villiger oxidations 
(Stewart, J.D., etal., J. Am. Chem. Soc. 120:3541-3548, 1998). 

It would be highly desirable to be provided with a new CPMO having an 

■ 

increased enzymatic activity for growing cells in a medium containing 
30 cyclopentanol or cyclopentanone as sole carbon source. 
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SUMMARY OF THE INVENTION 

One aim of the present invention is to provide a new CPMO having an 
increased enzymatic activity for growing cells in a medium containing 
cyclopentanol or cyclopentanone as. sole carbon source. 

■ 

5 In accordance with the present invention there is provided an isolated 

DNA encoding a cyclopentanone monooxygenase (CPMO), or an enzymatically 
active portion thereof, the isolated DNA being characterized by the ability to 
. hybridize specifically with the complement of the DNA represented in SEQ ID 
NO:8 under stringent hybridization conditions. 

io Also in accordance with the present invention, there is provided an 

isolated DNA, wherein it codes for a cyclopentanone monooxygenase (CPMO), 
and contains: 

(1 ) the nucleic acid sequence of SEQ ID NO:8; 

(2) a sequence corresponding to said nucleic acid sequence in 
15 the scope of the degeneration of the genetic code; or 

(3) a sequence hybridizing under stringent conditions with the 
sequence from (1) or (2), and still coding for cyclopentanone 
monooxygenase (CPMO). 

20 Still in accordance with the present invention, there is provided an 

isolated DNA encoding , a cyclopentanone monooxygenase (CPMO), or an 
enzymatically active portion thereof, said isolated DNA having SEQ ID NO:8. 

The present invention further provides an isolated DNA expression vector 
encoding an enzymatically active cyclopentanone monooxygenase (CPMO) 
25 comprising a DNA characterized by a sequence as set forth in SEQ ID NO:8, or 
a portion thereof, said portion encoding said CPMO, in expressible form. 

In accordance with the present invention, there is also provided a 
recombinant vector comprising the isolated DNA as decribed above, wherein 
the isolated DNA encodes cyclopentanone monooxygenase. 
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In a preferred embodiment of the present invention, the isolated DNA has 
a nucleic acid sequence of SEQ ID NO:8 or which, due to the degeneracy of the 
genetic code, is a functional equivalent thereof. 

Also in accordance with the present invention, there is provided a 

■ 

5 recombinant vector containing one or more copies of a recombinant DNA 
described above . 

The recombinant vector may be a prokaryotic vector. The recombinant 

■ 

vector may also be a plasmid. 

Therefore, in accordance with the present invention, there is also 
10 provided a biologically functional plasmid or viral DNA vector, which contains a 
DNA as described above. 

The present invention also provide a host cell comprising a recombinant 
vector as described above . 

Accordingly, there is also provided a cell transformed with a heterologous 
15 DNA expression construct encoding an enzymatically active cyclopentanone 
monooxygenase (CPMO) comprising a DNA characterized by a sequence as 
set forth in SEQ ID NO:8, or a portion thereof, said portion encoding said 
CPMO, in expressible form. 

The cell may be a prokaryotic cell or it may be E. coll 

20 Still in accordance with the present invention, there is also provided a 

purified cyclopentanone monooxygenase (CPMO) having : 

a) an amino acid sequence as set forth in SEQ ID NO:5; 

b) an amino acid sequence encoded by a nucleic acid sequence 
as set forth in SEQ ID NO:8; or 

25 c) an amino acid sequence encoded by a nucleic acid sequence 

hybridizing to a nucleic acid sequence complementary to the 
nucleic acid sequence of step b) above under stringent 
conditions, said amino acid sequence encoded in step c) 
having a same activity as the amino acid sequence in a). 
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The present invention also provides a recombinant cyclopentanone 
monooxygenase (CPMO) having an enzymatic activity superior to the one from 
a native Pseudomonas, and more preferably twice superior. 

5 The recombinant cyclopentanone monooxygenase (CPMO) may be 

prepared from Comamonas sp. NCIMB 9872. The recombinant cyclopentanone 
monooxygenase (CPMO) has preferably a sequence as set forth in SEQ ID 
NO:5. 

A method for growing cells in vitro in presence of cyclopentanol or 
10 cyclopentanone as sole source of carbon, said method comprising the steps of: 

i 

a) transforming a cell with the expression construct described 
above; and 

b) growing the cell of step a) under suitable conditions in a 
medium containing cyclopentanol or cyclopentanone as a sole 

1 5 source of carbon. 

To increase this gene potential, according to the invention it is reported 
herein the cloning of a cyclopentanone monooxygenase (CPMO)-encoding 
gene (cpnB) from Comamonas (Pseudomonas ) sp. NCIMB 9872, the 
determination of its DNA and surrounding sequence and expression of CPMO 
20 activity and protein in £. coll 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates the first two steps of cyclopentanol degradation by 
Pseudomonas sp. NCIMB 9872; 

Fig. 2 illustrates the genetic organization in Comamonas sp. NCIMB 
25 9872 in the Sph\ fragment containing cyclopentanone monooxygenase- 
encoding gene (cpnB) and additional open reading frames; 

Fig. 3 illustrates an alignment of the amino acid sequence of the CPMO 
of. Comamonas sp. NCIMB 9872 with that of CHMO from Acinetobacter sp. 
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NCIMB 9871 and a steroid monooxygenase (STMO) from Rhodococcus 
rhodochrous; 

Fig. 4 illustrates a SDS-PAGE of crude extracts from £. coli (pCMP201 ); 

and 

5 Fig. 5 illustrates CPMO-encoding gene, designated cpnB. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Cloning of the Comamonas sp. NCIMB 9872 CPMO-encoding gene 

Pseudomonas sp. NCIMB 9872 (henceforth strain 9872) identified as a 
Comamonas by 16S rDNA sequencing in this study, was purchased from the 

10 National Collections of Industrial and Marine Bacteria Ltd (NCIMB, Aberdeen, 
Scotland) and grown at 30°C in Luria-Bertani (LB) broth (Sambrook, J., et al., 
Molecular cloning: a laboratory manual, 2nd ed. Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N. Y, 1989), or mineral salt medium (MSM), pH 7.0, 
containing 2 ml of cyclopentanone. The MSM recipe contains per liter: 1 .0 g of 

15 NH4NO3, 1.5 g of KH2PO4, 1.5 g of Na 2 HP0 4 , 0.2 g MgS0 4 -7H 2 0, 0.01 g of 
CaCI 2 '2H 2 0, 0.005 of FeSO«7H 2 0, 0.002 g of MnS0 4 '4H 2 0 and 0.1 g of yeast 
extract. Agar was added to 1.5% for plates. Genomic DNA of strain 9872 was 
prepared by the Marmur method (Marmur, J., J. Mol. Biol. 3: 208-218, 1961). At 
first, a Southern hybridization of DNA digested with BamHI was carried out 

20 using the Acinetobacter NCIMB 9871 CHMO-containing gene as probe. Since 
there was no positive result (hybridization conditions carried out at 65°C) the 
CPMO protein was purified in order to obtain an N-terminal amino acid 
sequence. The purification of CPMO protein from cyclopentanone-grown cells 
was according to Griffin and Turgill (Griffin, M., et-al., Eur. J. Biochem. 63:199- 

25 209, 1976). Using an automated protein sequencer (Perkin-Elmer model 477) a 
40-residue amino-terminal sequence of the purified CPMO was obtained (Fig. 

* 

2). This sequence, longer by 1 1 amino acids, is in perfect agreement with that 
reported previously from the same organism (Willetts, A., Trends in Biotech. 
15:55-62, 1997). Two degenerate oligodeoxynucleotide primers (5- 
30 ACIACIATGA CIACNATGAC-3' (SEQ ID NO:1) and 5'-ARRTGRTAIA 
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RYTGRTA-3' (SEQ ID NO:2), corresponding to amino acids 2-8 and 35-40, 
respectively) were synthesized to amplify a 116-bp product from total DNA 
prepared from strain 9872. The PCR amplification was performed in a Perkin 
Elmer-Model 2400 Thermal Cycler™ and the amplification conditions were 
5 94°C for 1 min, 50°C for 1 min and 72°C for 1 min for 30 cycles. The amplified 
product was cloned directly in the pXcmkn12 vector (Cha, J., et al., Gene 136, 
369-370, 1993), transformed in E. coli JM109 and the resulting plasmid was 
designated pCMP10. Before using the amplified product as a gene probe its 

i 

nucleotide sequence was confirmed. Nucleotide sequencing was determined by 
io the Taq DyeDeoxy terminator cycle sequencing kit and the ABI Prism 310 
Genetic Analyzer (Perkin Elmer). Plasmid isolation was performed by the 
method of Birnboim and Doly (Birnboim, H. C, and J. Doly, DNA. Nucleic Acids 
Res. 7:1513-1523, 1979). 

In Fig. 2, Orfl is most likely a transcriptional activator of the NtrC-type 
15 (Morett, E., L. Segovia, J. Bacterid. 175:6067-6074, 1993). The amino acid 
sequence of ORF1 (C-terminal 391 -amino acids) showed 38-40% identity to 
equivalent regions of proteins such as NTRC_ECOLI (Nitrogen regulation 
protein NR(I) from £. coli; Miranda, et al., The complete nucleotide sequence of 
the glnALG operon ofEschericha coliK12 15:2757-2770, 1987), ACOR_ALCEU 
o (Acetoin catabolism regulatory protein from Ralstonia eutropha; Kruger, N., et 
al., J. Bacterid. 179:4391-4400, 1992). The amino acid sequence of ORF2, 
showing similarity to enzymes of the short-chain alcohol dehydrogenase family 
(Jornvall, H., et al., Biochemistry 34: 6003-6013, 1995), is most homologous 
(45-46% identity) to a putative oxidoreductase CY39.16C of Mycobacterium 
5 tuberculosis (Swiss Prot sp:Q10855) and fadG3 of M. tuberculosis (GenBank 
accession number Z74025). For Pseudomonas sp. strain HI-201 the /acZ-Km r 
cassette from pKOK6.1 (Kokotek, W., et al., Gene 84: 467-471, 1989) was 
inserted into cpnB at the Ate/I site. 

In Fig. 2, the following terms are defined as follows: t w , transcriptional 
o termination sequence of phage fd; Km r , kanamycin resistance gene, lacZ, gene 
encoding b-galactosidase. Genes and markers are indicated with arrows. 
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To clone the CPMO-containing gene, the DNA insert from pCMPIO was 
amplified, labeled by the digoxigenin-11-UTP system according to 
manufacturer's instructions (Boehringer Mannheim GmbH) and used to probe a 
Southern hybridization of strain 9872 genomic DNA digested with various 
5 restriction enzymes (SamHI, EcoRI, H/ndlll, Kpn\, Nhe\, Psti, Sail, Sph\ and 
Xbal). As a result, a single hybridizing band of ca 4.3-kb Sph\ fragment was 
obtained. Conditions of hybridization were as before. Subsequently, a purified 
4.0- to 4.5-kb size fraction of Sphl-cut total DNA separated on a 0.8 % agarose 
gel was ligated to E. coli plasmid pUC18, which had been linearized and 
10 dephosphorylated. A clone containing the 4.3-kb insert was screened by colony 
hybridization using the PCR product as a probe; this recombinant plasmid was 
designated pCMP200. 

♦ 

DNA sequence of the CPMO-encoding gene (cpnB) and the flanking region 

Nucleotide sequencing of the CPMO-encoding gene was initiated by 

15 using a primer designed from the sequence of the PCR product cloned in 
pCMPIO and further extended using oligonucleotides derived from the new 
sequence. Both DNA strands of the Sph\ fragment were sequenced and found 
to consist of 4281 base pairs (bp). The sequence was analyzed by GENETYX- 
Mac (Software Development Co., Ltd. Chiba, Japan) and the BLAST program 

20 (Altschul, S. F., et al., Nucleic Acids Res. 25:3389-3402, 1997). As a result 
three open reading frames (ORFs) arranged in the same direction were 
predicted (Fig. 2). The nucleotide sequence of the 1650-bp ORF encoding 
CPMO is preceded by a partial ORF1 (1 173-bp) coding for the C-terminus of an 
NtrC-type transcriptional activator (Miranda, et al., The complete nucleotide 

25 sequence of the glnALG operon of Eschericha coli K12 15:2757-2770, 1987) 
and by a complete ORF2 (750-bp) coding for a homolog of the short-chain 
dehydrogenases/reductases (Jornvall, H., et al., Biochemistry 34: 6003-6013, 
1995). The two intergenic regions are 244-bp and 32-bp, respectively. The 
CPMO-encoding gene is referred to cpnB (cyclopentanone and B designates 

30 the second step of the degradation pathway, see Fig. 1) hereafter. In Fig. 5, the 
CPMO-encoding gene starts at nucleotide position 1822 and ends 3471 that, 
does not include the stop codon. Accordingly, the boundary of cpnA is 3507- 
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4256. The partial open reading frame preceding cpnB is from 1-1 174. 

Fig. 1 has been adapted from Griffin, M., et al. (Griffin, M., et al M 
Biochem. J. 129:595-603, 1972). The designated genes are: cpnA encoding 
cyclopentanol dehydrogenase; cpnB encoding cyclopentanone 1,2- 
5 monooxygenase (CPMO). An alternative name for 5-vaIerolactone is 5- 
pentanolide. Subsequent reaction steps are the formation of 5-hydroxyvalerate, 
5-oxovalerate, glutarate and finally acetyl Co A. 

The amino acid sequence of the. CPMO enzyme consists of 550 residues 
(Fig. 3). This sequence shows 36.5% identity and an additional 13.6% amino 

10 acid similarity with the 543-residue CHMO of Acinetobacter sp. strain NCIMB 
9871. An equally related protein (549 amino acids; 37.3% identity and 12.4% 
similarity) is the putative steroid monooxygenase (STMO) of Rhodococcus 
rhodochrous (Morii, S., et al., GenBank accession number AB010439, 1998). 
The latter enzyme carries out the oxidation of progesterone to produce 

15 testosterone acetate. A CLUSTAL alignment of these three sequences gave 
24.6% positional identity (Fig. 3). 

i 

t 

In Fig. 3, asterisks indicate identical amino acids, dots indicated similar 
amino acids and dashes indicate gaps introduced to maximize the alignment 
The amino-terminal peptide sequence confirmed by Edman degradation is 

■ 

20 underlined. The locations of the consensus FAD fingerprint sequences as 
described by Eppink et al. (Eppink, et al., Prot ScL 6:2454-2458, 1997) are as 
indicated. The conserved GD motif found in flavoprotein hydroxylases as a 
second FAD fingerprint is also indicated. Not shown is the DG motif of 
flavoprotein hydroxylases which has the sequence of chhhssDGxcSxhR. Lower 

25 case letters identify certain residues types: h, hydrophobic residues, s, small 
residues, c, charged residues, and x, any residues. Note that a DG doublet is 
present in CPMO and STMO sequence. 

A notable sequence motif present in CPMO and related proteins is the 
FAD-binding fingerprint (GXGXXG) that is similar to those found in flavoprotein 
3 0 hydroxylases (Eppink, et al., Prot Sci. 6:2454-2458, 1997). Flavoprotein 
hydroxylases (eg. phenol hydroxylase, the structure is now known; Enroth, C, 
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■ 

et al., Structure 6:605-617, 1998) are monooxygenases that catalyze the 
insertion of one atom of molecular oxygen into the substrate using NAD(P)H as 
electron donor. These proteins possess a conserved "Asp-Gly (DG)" motif for 
both FAD and NAD(P)H binding in between two fingerprint motifs for the FAD 

5 binding (fingerprint 1: GXGXXG; fingerprint 2: Gly-Asp [GD] motif). Sequence 
motifs in CPMO, STMO and CHMO differ from those in flavoprotein 
hydroxylases by having a repeated GXGXXG motif (amino acids 24 to 33 and 
1 93 to 202 in CPMO numbering). The possibility that the second FAD fingerprint 
in CPMO and related proteins fulfils a dual role of FAD and NADPH binding 

o awaits structural determination of a representative member of this family of 
proteins. It is reasonable to assume that a different mechanism in catalysis is 
reflected in the motifs seen in the two classes of proteins. 

< 

Expression of cpnB gene in £, coli 

Two primers of the following sequence were synthesized to amplify the 

5 cpnB gene and the resultant 1 .7-kb DNA fragment was cloned in the pSD80 
plasmid to yield pCMP201. Plasmid pSD80 is a third generation derivative of the 
commercially available pKK223-3 vector (Pharmacia) that contains a tac 
promoter upstream of the multiple cloning site (MCS), an unc terminator 
sequence downstream of the MCS, and lacl q elsewhere on the plasmid (Smith, 

o S.P., et al., Biochemistry 35:8805-8814, 1996). The primers were: 5'- 
AA AAGGCCT G AACTTCAATT ATTTAGGAGA C-3' (SEQ ID NO:3) and 5'- 
AAA ACTGCAG GAGTTGCACA ACAGAGTCTT AG-3' with built-in Stu\ and Psfl 
restriction sites (underlined), respectively, to facilitate cloning at the compatible 
sites (Smal and Psfl) of the pSD80 vector. Vent DNA polymerase (New England 

5 BioLabs, Beverly, MA) was used and the amplification conditions were 94°C for 
1 min, 50°C for 1 min, and 72°C for 1 min, for 30 cycles. The amplified DNA 
fragment was purified from an agarose gel and digested with Stu\ and Psfl. One 
of the resulting recombinant plasmids was designated pCMP201 . By DNA 
sequencing it was established that no mutation had been introduced in the cpnB 

o gene during PCR amplification. 
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Fig. 4 shows the production of a 60-kDa protein in a Coomassie blue- 
stained SDS-polyacryiamide gel of the crude protein extract prepared from E. 
. coii JM109 (pCMP201) cells that were induced by 0.1 mM isopropyl-beta-D- 
thiogalactopyranoside (IPTG). The cells were induced at an absorbance (A600 
5 nm) of 0.4 to 0.5 and the induction period was up to 4 hr. The observed 
molecular mass was in agreement with the predicted size of the 62-kDa CPMO. 
In the absence of IPTG, this protein band was not produced. Also, the CPMO 
enzyme activity was observed only in those cells grown in the presence of 
IPTG. CPMO activity was assayed at 25°C by measuring a decrease in 

10 absorbance at 340 nm in 50 mM phosphate buffer (pH 7.8) containing 1 umol of 
cyclopentanone, 0.2 umol of NADPH, and the crude enzyme extract prepared 
from E. coii JM109 (pCMP201). These cells were cultivated in 100 ml of LB 
medium containing 100 ug/ml of ampicillin at 25°C. The IPTG-induced cells 
were harvested by centrifugation, washed in 50 mM phosphate buffer (pH 7.2), 

15 resuspended in 1/20 volume of same buffer, and sonicated by four-20 sec 
bursts with a Braun-Sonifier™ 250 apparatus. After centrifugation for 30 min at 
18,000 x g and at 4°C, the supernatant was used for determination of enzyme 
activity. One unit (U) of activity is defined as the amount of enzyme required to 
convert 1 umol of substrate in 1 min. Protein concentration was determined by 

2 0 the method of Bradford (Bradford, M. M„ Anal. Biochem. 72: 248-254, 1 976). As 
a result the specific activity of the CPMO enzyme was found to be 0.28 U/mg. 
The specific activity of CPMO in the native Pseudomonas was reported to be 
0.11 U/mg (Griffin, M., et al., Biochem. J. 129:595-603, 1972). 

V 

In Fig. 4, lane 1 has been loaded with extracts of IPTG-induced E. coii 
25 and lane 2 has been loaded with extracts of E. coii in absence of IPTG. M 
means molecular weight markers as indicated in kilo daltons. The arrow 
indicates the production of the desired 60-kDa protein. 

Inactivation of cpnB gene 

Pseudomonas sp. strain HI-201 was constructed by chromosomal 
30 inactivation of the cpnB gene using a /acZ-Km r cassette from the mobilizable 
pKOK6.1 vector (Kokotek, W., et al., Gene 84: 467-471, 1989). In pKOK6.1 the 
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lacZ gene is promoterless and in addition to Km r it is ampicillin resistant (Ap r ). 
The /acZ-Km r cassette was excised as a Psfl-fragment and inserted into the A/s/1 
site within the cpnB gene in pCMP200, yielding pCMP202. Electroporation of 
this plasmid into 9872 cells was carried out in the Gene Pulser™ (BioRads) and 
5 the parameters of electroporation were 2.5 kV, 25 uF and 200 ohm. The cells 
were initially washed with 1mM HEPES buffer and resuspended in 1mM HEPES 
containing 10% glycerol. Km r colonies were selected on LB plates containing 
Km (250 ug/ml). To select for double crossover mutants, a second screening on 
LB plates containing Ap (300 ug /ml) was carried out. The inactivation of cpnB 
10 (Fig. 2), was confirmed by PCR. The resulting mutant HI-201 was found not to 
be able to grow on cyclopentanol or cyclopentanone as a sole carbon and 
energy source. This result indicated that cpnB is essential for the degradaton of 
cyclopentanol and it appeared that there was only one copy of the cpnB gene in 
strain 9872. 

15 As expected of a flavoprotein the amino acid sequence of CPMO 

contains motifs of FAD fingerprints similar to those found in flavoprotein 
hydroxylases. 

Nucleotide sequence accession number 

The DNA sequence of the 4,281 -bp Sph\ fragment has been submitted to 
20 DDBJ and assigned accession number AB022102. The release of this data 
awaits the inventors' authorization. 

While the invention has been described in connection with specific 
embodiments thereof, it will be understood that it is capable of further 
modifications and this application is intended to cover any variations, uses, or 
25 adaptations of the invention following, in general, the principles of the invention 
and including such departures from the present disclosure as come within 
known or customary practice within the art to which the invention pertains and 
as may be applied to the essential features hereinbefore set forth, and as 
follows in the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1 . An isolated DNA encoding a cyclopentanone monooxygenase (CPMO), 
or an enzymatically active portion thereof, the isolated DNA being characterized 
by the ability to hybridize specifically with the complement of the DNA 
represented in SEQ ID NO:8 under stringent hybridization conditions. 

2. An isolated DNA, wherein it codes for a cyclopentanone monooxygenase 
(CPMO), and contains: 

(a) the nucleic acid sequence of SEQ ID NO:8; 

(b) a sequence corresponding to said nucleic acid sequence in 
the scope of the degeneration of the genetic code; or 

(c) a sequence hybridizing under stringent conditions with the 
sequence from (1 ) or (2), and still coding for cyclopentanone 
monooxygenase (CPMO). 

3. An isolated DNA encoding a cyclopentanone monooxygenase (CPMO), 
or an enzymatically active portion thereof, said isolated DNA having SEQ ID 
NO:8. 

4. An isolated DNA expression vector encoding an enzymatically active 
cyclopentanone monooxygenase (CPMO) comprising a DNA characterized by a 
sequence as set forth in SEQ ID NO:8, or a portion thereof, said portion 
encoding said CPMO, in expressible form. 

5. A recombinant vector comprising the isolated DNA of any one of claims 1 
to 3, wherein the isolated DNA encodes cyclopentanone monooxygenase. 

6. The recombinant vector of claim 5, wherein the isolated DNA has a 
nucleic acid sequence of SEQ ID NO:8 or which, due to the degeneracy of the 
genetic code, is a functional equivalent thereof. 

7. A recombinant vector, wherein it contains one or more copies of a 
recombinant DNA according to claim 1 , 2 or 3. 

8. A recombinant vector according to claim 4, 5, 6 or 7, wherein it is a 
prokaryotic vector. 
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9. A recombinant vector according to claim 4, 5, 6 or 7, wherein it is a 
plasmid. 

10. Biologically functional plasmid or viral DNA vector, which contains a DNA 
as defined in claim 1 , 2 or 3. 

* 

11. A host cell comprising a recombinant vector of any one of claims 4 to 9. 

12. A cell transformed with a heterologous DNA expression construct 
encoding an enzymatically active cyclopentanone monooxygenase (CPMO) 
comprising a DNA characterized by a sequence as set forth in SEQ ID NO:8, or 
a portion thereof, said portion encoding said CPMO, in expressible form. 

13. A cell as defined in claim 12, which is a prokaryotic cell. 

14. A cell as defined in claim 12, which is E. coli. 

* 

15. A purified cyclopentanone monooxygenase (CPMO) having : 

(a) an amino acid sequence as set forth in SEQ ID NO:5; 

(b) an amino acid sequence encoded by a nucleic acid sequence 
as set forth in SEQ ID NO:8; or 

(c) an amino acid sequence encoded by a nucleic acid sequence 
hybridizing to a nucleic acid sequence complementary to the. 
nucleic acid sequence of step b) above under stringent 
conditions, said amino acid sequence encoded in step c) 
having a same activity as the amino acid sequence in a). 

16. A recombinant cyclopentanone monooxygenase (CPMO) having an 
enzymatic activity superior to the one naturally occurring. 

17. The recombinant cyclopentanone monooxygenase (CPMO) of claim 16, 
wherein said CPMO is prepared from Comamonas sp. NCIMB 9872. 

18. The recombinant cyclopentanone monooxygenase (CPMO) of claim 16, 
wherein said CPMO has a sequence as set forth in SEQ ID NO:5. 
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19. The recombinant cyclopentanone monooxygenase (CPMO) of claim 16, 
wherein said CPMO has an enzymatic activity twice superior to that of a CPMO 
from a native Pseudomonas. 

20. A method for growing cells in vitro in presence of cyclopentanol or 
cyclopentanone as sole source of carbon, said method comprising the steps of: 

(a) transforming a cell with the expression construct of claim 4, 5, 
6, 7, 8 or 9; and 

(b) growing the cell of step a) under suitable conditions in a 
medium containing cyclopentanol or cyclopentanone as a sole 
source of carbon. 
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GxGxxG 

CHMO 1: MSQKMDFDAIVIGGGFGGLYAVKKLRDELELKVQAFDKATDVAGT 45 

STMO 1 :MNGQHPRSVVTAPDATTGTTSYDVVVVGAGIAGLYAIHRFRSQ-GLTVRAFEAASGVGGV 59 

CPMO ltM TTMFTMTTEQLGMNNSVNDKLDVLLIGAGFTGLYQLYHLR KL-GYKVHLVDAGADIGGI 59 

* * * *** * * * 

CHMO 46: WYWNRYPGALTDTETHLYCYSWDKELLQSLEIKKKYVQGPDVRKYLQQVAEKHDLKKSYQ 105 
STMO 60:WYWNRYPGARCDVESIDYSYSFSPELEQEWNWSEKYATQPEILAYLEHVADRFDLRRDIR 119 
CPMO 60:WHWNCYPGARVDTHCQIYQYSI-PELWQEFNWKELFPNWAQMREYFHFADKKLDLSKDIS 118 

*** **** * ****** t m _ .. *. . ** . 

CHMO 106: FNTAVQSAHYNEADALWEVTTEYGDKYTARFLITALGLLSAPNLPNIKGINQFKGELHHT 165. 

STMO 120: FDTRVTSAVLDEEGLRWTVRTDRGDEVSARFLVVAAGPLSNANTPAFDGLDRFTGDIVHT 179 

CPMO 119: FNTRVQSAVFDEGTREWTVRSIGHQPIQARFVIANLGFGASPSTPNVDGIETFKGQWYHT 178 

***** * * * *** * ****** 

GxGxxG 

CHMO 1 66 : SRWPDD-VSFEGKRVGVIGTGSTGVQVITAVAPLAKHLTVFQRSAQYSVPIGNDPLSEED 224 

STMO 180: ARWPHDGVDFTGKRVGVIGTGSSGIQSIPIIAEQAEQLFVFQRSANYSIPAGNVPLDDAT 239 

CPMO 179:ALWPQEGVNMAGKRVAIIGTGSSGVQVAQEAALDAKQVTVYQRTPNLALPMHQKQLSAED 238 

** * **** ***** ** ** *** *. * 

, • ( ••••••••• 

CHMO 225 : VKKIKDNYDKSLGWCMNSALAFALNESTVPAMSVSAEERKAVFEKAWQTGGGFRFMFETF 284 
STMO 240: RAEQKANYAERRRLSRESGGGSPHRPHPKSALEVSEEERRAVYEERWKLGG — VLFSKAF 297 

CPMO 239 : NLRMKPELPAAFERRGKCFAGFDFDFIAKNATELSAAERTEILEELWNAGG-FRYWLANF 297 

* ********* * 

. • ... 
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CHMO 285:GDIATNMEANIEAQNFIKGKIAEIVKDPAIAQKLMPQD LYAKRPLCDSGYYNTFNR 340 

STMO 2 98 . PDQLTDPAANDTARAFWEEKIRAVVDDPAVAELLTPKDH — AIGAKRIVTDSGYYETYNR 355 . 

CPMO 298 :QDYLFDDKANDYVYEFWRDKVRARIKDPKVAEKLAPMKKPHPYGAKRPSLEQWYYEIFNQ 357 

* ** * * ***** *** ** * 

CHMO 341 . DNVRLEDVKANPIVEITENGVKLENGDFVELDMLICATGFDAVDGNYVRMDIQGKNGLAM 400 

STMO 356 :DNVELVDLRSTPIVGMDETGIVT-TGAHYDLDMIVLATGFDAMTGSLDKLEIVGRGGRTL 414 

CPMO 358 :NNVTLVDVNETPVLRITEKGIVT-AEGEAEFDLIVFATGFDAVTGGITSIDFRNNQGQSF 416 

.****. *...**. ..*... ******. * ... * . . 

CHMO 401 :KDYWKEGPSSYMGVTVNNYPNMFMVLGPNGP — FTNLPPSIESQVEWISDTIQYTVENNV 458 

STMO 415 :KETWAAGPRTYLGLGIDGFPNFFNLTGPGSPSVLANMVLHSELHVDWVADAIAYLDARGA 474 

CPMO 4 17 : KDVWSDGIRTQLGVATAGFPNLLFGYGPQSPAGFCNGPSSAEYQGDLLIQLMNYLRDNNI 476 

*** * ** **** * * 

CHMO 4 5 9 : ES I EATKEAEEQWTQTCANI AEMTLFPKAQS WI FGANI PGKKNTVY FYLGGLKEYRTCAS 518 
STMO 475 : AGIEGTPEAVADWVEECRNRAEASLLNSANSWYLGANIPGRPRVFMPFLGGFGVYREIIT 534 

CPMO 477 : SRIEAQSEAQEEWSKLIADFWDSSLFPRAKSWYQGSNIPGKKVESLNFPLGLPTYISKFN 536 

***** ***** **** * * 

• •••••• • ,« •• 

CHMO 519-.NCKNHAYEGFDIQLQRSDIKQPANA 543 SEQ ID NO:7 
STMO 535 :EVAESGYKGFAILEG 549 SEQ ID NO: 6 

CPMO 537 :ESAEKGYAGFSLAS 550 SEQ ID NO: 5 

* ** 

• • • 
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10 20 30 40 50 60 

I I I I I I 

1 GCATGCGATAGCGGCACGTTGGTGGGCGGGCGAGCGCGGCCACGAGCGCGTCTCTCGCTC 

CGTACGCTATCGCCGTGCAACCACCCGCCCGCTCGCGCCGGTGCTCGCGCAGAGAGCGAG 
61 CACCGACATCACATCGCCGGTCACCGGCGAGAACATTGGCAAGATCCTGCGCTTTTCGCC 

GTGGCTGTAGTGTAGCGGCCAGTGGCCGCTCTTGTAACCGTTCTAGGACGCGAAAAGCGG 
121 GCTGAAGCCGCAGGGCTACGAGAGAATTTTTCAGTCCGGCCTGGCGGACTCGACCCAGCA 

CGACTTCGGCGTCCCGATGCTCTCTTAAAAAGTCAGGCCGGACCGCCTGAGCTGGGTCGT 
1 8 1 CACCCCCAGGATGGTGCGTTTCGGACAGTTCCTGTCCTGTGATCCAGAGACCATCAACAC 

GTGGGGGTCCTACCACGCAAAGCCTGTCAAGGACAGGACACTAGGTCTCTGGTAGTTGTG 

2 4 1 CCTGGAGACATTGGAGCGGATCGCGCGCGCCGACGTCAATGTCTTGCTGCATGGAGAGAC 

GGACCTCTGTAACCTCGCCTAGCGCGCGCGGCTGCAGTTACAGAACGACGTACCTCTCTG 

3 0 1 GGGC ACGGGC AAAGAGCTGAT AGCTCGCCACATTCATGTCGCGAGCCGCCGGCGGGATGC 

CCCGTGCCCGTTTCTCGACTATCGAGCGGTGTAAGTACAGCGCTCGGCGGCCGCCCTACG 
361 GCCCTACCTCGCCATCAACTGCGGGGCMTCAGCTCGGAGTTGCTGGAGAGTACTTTTTT 
CGGGATGGAGCGGTAGTTGACGCCCCGTTAGTCGAGCCTCAACGACCTCTCATGAAAAAA 

4 2 1 TGGCTATGTGCGCGGAGCATTCTCCGGGGCAGATCCCAAGGGCCGCGCCGGCTACTTTGA 

ACCGATACACGCGCCTCGTAAGAGGCCCCGTCTAGGGTTCCCGGCGCGGCCGATGAMCT 
4 8 1 ATCGGTGGGCGAAGGCACCTTGTTTCTGGACGAGATCGGCGAGCTGCCCTTGGCCATGCA 

TAGCCACCCGCTTCCGTGGAACAAAGACCTGCTCTAGCCGCTCGACGGGAACCGGTACGT 
541 GGCGGCATTGTTGCGCGTGCTGGAAGACGGAAGCTTTCTGCGTGTGGGCTCATCGACCCC 

CCGCCGTAACAACGCGCACGACCTTCTGCCTTCGAAAGACGCACACCCGAGTAGCTGGGG 
601 GCAGCGCGCCTCGTGCCGCATTATCGCGGCCACGCACCGCAATCTGGAGGAACTCATCGC 

CGTCGCGCGGAGCACGGCGTAATAGCGCCGGTGCGTGGCGTTAGACCTCCTTGAGTAGCG 
661 ACAGGGCCTGTTTCGCCAGGATCTCTACTACCGTCTCAAGATCGTTCAGAAAAGGCTCAA 

TGTCCCGGACAAAGCGGTCCTAGAGATGATGGCAGAGTTCTAGCAAGTCTTTTCCGAGTT 
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721 GCCGATACGGGAACGCACCTGTGACATCGCCTTGCTGGCAGAGCAATTCAAATCTGCACT 
CGGCTATGCCCTTGCGTGGACACTGTAGCGGAACGACCGTCTCGTTAAGTTTAGACGTGA 

7 8 1 GGCGCAAAAGCACCAGATCCCGAACGTGCAGATCCACCCCGAGGCCATGGCAGTCATGG A 

CCGCGTTTTCGTGGTCTAGGGCTTGCACGTCTAGGTGGGGCTCCGGTACCGTCAGTACCT 

8 4 1 GCGCTACCAGTGGCCGGGCAATGCGCGCGAAATCCGCAATGTGATGGAAGCCGCCCTGAT 

CGCGATGGTCACCGGCCCGTTACGCGCGCTTTAGGCGTTACACTACCTTCGGCGGGACTA 
901 CTGCTCCGATGGTGAAATCACGCTCGCCAGCCTGCCCCCGGAAGTATCTGAGAACTCCAC 

GACGAGGCTACCACTTTAGTGCGAGCGGTCGGACGGGGGCCTTCATAGACTCTTGAGGTG 
961 CTATCCTCTGCAMGCCGGGTGGCCGAAGCGAATAGCGAGATGCCTCCCGTCTCCGGCAA 

GATAGGAGACGTTTCGGCCCACCGGCTTCGCTTATCGCTCTACGGAGGGCAGAGGCCGTT 
1021 CGACTATGAACGGCAACTCATCGTGGGTTTGCTGCGCAAGTACCGAAAGGTCAATCATGT 

GCTGATACTTGCCGTTGAGTAGCACCCAAACGACGCGTTCATGGCTTTCCAGTTAGTACA 
1081 GGCGAGGGCCCTGGGCCTCGCGCGCTCCACGCTCTATCGGAAATTCGCAGACTTGGGCAT 

CCGCTCCCGGGACCCGGAGCGCGCGAGGTGCGAGATAGCCTTTAAGCGTCTGAACCCGTA 
1141 TGACCAACGCGAGTATGTCAATGATCTGTCCGATTAACACTGCTGCGCCCTGCGCTTGCG 

ACTGGTTGCGCTCATACAGTTACTAGACAGGCTAATTGTGACGACGCGGGACGCGAACGC 
1201 GCAACGCAGTATCGTCGGAACGCGGGC AACGCTAAAGGCAGCCGCCGTCGGCATCGGTTG 

CGTTGCGTCATAGCAGCCTTGCGCCCGTTGCGATTTCCGTCGGCGGCAGCCGTAGCCAAC 
1261 CCGCCCGGTGTTGTTTGTCGGCTGAGACCGCCTGCAGCACCAGCCCCTCCATTCCTCGTG 

GGCGGGCCACAACAAACAGCCGACTCTGGCGGACGTCGTGGTCGGGGAGGTMGGAGCAC 
1321 TCTAGATCTCCTTGCAACGCTGTCACAGCGTGCTTTTTTCGTGGCGCGGGACTTCTCGCG 

AGATCTAGAGGAACGTTGCGACAGTGTCGCACGAAAAAAGCACCGCGCCCTGAAGAGCGC 
1381 ATGAGGGCCTGCAAGGGCGGCGCATCAAAACAAGCGCCCTGTCCTCAGGACGTTTGCGCC 

TACTCCCGGACGTTCCCGCCGCGTAGTTTTGTTCGCGGGACAGGAGTCCTGCAAACGCGG 
1441 GTCGCTGGCTATGGGCAGTCTGCATGATCCGCTTGGGCTTGCCCACCATGCGCGCGACTG 

1-t=t- SB 
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CAGCGACCGATACCCGTCAGACGTACTAGGCGAACCCGAACGGGTGGTACGCGCGCTGAC 
1501 CGCTCTTTCCTGTTGGGCGCATCGTCATCCCGACCTTCCTGAAATCCATTTCCACAGGGC 

GCGAGAAAGGACAACCCGCGTAGCAGTAGGGCTGGAAGGACTTTAGGTAAAGGTGTCCCG 
1561 CGGCCTGAGCCAGACAGCGATCGGCAGGCTCTCGCATCTCGGACAGTGGTGCGCTACACC 

GCCGGACTCGGTCTGTCGCTAGCCGTCCGAGAGCGTAGAGCCTGTCACCACGCGATGTGG 
1621 ACAGGACACAAGTGTCCGGCGCGTGGCACGCAGCGGCCTTTGCTGCGCACGATGCGTGCT 

TGTCCTGTGTTCACAGGCCGCGCACCGTGCGTCGCCGGAAACGACGCGTGCTACGCACGA 
1681 TTTGGCACATGGAAACCCTCGGTTTTTCCGAACCAGAGACCTGGCACAAGTCTTGATAAC 

AAACCGTGTACCTTTGGGAGCCAAAAAGGCTTGGTCTCTGGACCGTGTTCAGAACTATTG 
1741 TCCTGAGTGACTGATGCAACGCATCTCAGTCCGGGAGCGTGGACGACACGCCCCTGAACT 

AGGACTCACTGACTACGTTGCGTAGAGTCAGGCCCTCGCACCTGCTGTGCGGGGACTTGA 
1801 TCAATTATTT AGGAG ACCCATATGACC ACCATGACCACCATGACC ACCGAACAACTCGGC 

AGTTAATAAATCCTCTGGGTATACTGGTGGTACTGGTGGTACTGGTGGCTTGTTGAGCCG 
1861 ATGAACAACTCTGTCAATGACAAGCTTGACGTTTTGCTCATCGGCGCCGGCTTCACCGGT 

TACTTGTTGAGACAGTTACTGTTCGAACTGCAAAACGAGTAGCCGCGGCCGAAGTGGCCA 
1921 CTCTACCAGCTCT ATCACCTGCGC AAGCTGGGCTAC AAGGTTC ATCTCGTCGACGCCGGT 

GAGATGGTCGAGATAGTGGACGCGTTCGACCCGATGTTCCAAGTAGAGCAGCTGCGGCCA 
1981 GCCGATATTGGCGGGATCTGGCATTGGAACTGCTACCCCGGAGCGCGTGTGGAT ACCCAC 

CGGCTATAACCGCCCTAGACCGTAACCTTGACGATGGGGCCTCGCGCACACCTATGGGTG 
2041 TGCCAGATCTACCAGTACTCCATTCCAGAGTTGTGGCAGGAGTTCAACTGGAAAGAGCTG 

ACGGTCTAGATGGTCATGAGGTAAGGTCTCMCACCGTCCTCAAGTTGACCTTTCTCGAC 
2101 TTCCCT AACTGGGCGCAAATGCGCGAGTATTTCCATTTTGCCGACAAGAAGCTCGACCTG 

AAGGGATTGACCCGCGTTTACGCGCTCATAAAGGTAAAACGGCTGTTCTTCGAGCTGGAC 
2161 AGCAAGGACATCAGCTTCAACACCCGTGTGCAGTCGGCCGTCTTTGACGAAGGCACACGC 

TCGTTCCTGTAGTCGAAGTTGTGGGCACACGTCAGCCGGCAGAAACTGCTTCCGTGTGCG 

ne=r 5H 
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2221 GAATGGACGGTACGCTCGATCGGACACCAGCCGATCCAGGCCAGGTTCGTCATCGCCMC 

CTTACCTGCCATGCGAGCTAGCCTGTGGTCGGCTAGGTCCGGTCCAAGCAGTAGCGGTTG 
2281 CTTGGCTTCGGTGCCAGCCCCAGCACGCCCAATGTCGATGGCATCGAGACATTCAAAGGC 

GAACCGAAGCCACGGTCGGGGTCGTGCGGGTTACAGCTACCGTAGCTCTGTAAGTTTCCG 
2341 CAGTGGTATCACACTGCTCTGTGGCCCCAGGAAGGCGTGAAC ATGGCCGGC AAGCGCGTG 

GTCACCATAGTGTGACGAGACACCGGGGTCCTTCCGCACTTGTACCGGCCGTTCGCGCAC 
2401 GCCATCATTGGCACCGGCTCCAGCGGGGTCCAGGTCGCCCAGGAGGCTGCCCTTGATGCG 

CGGTAGTAACCGTGGCCGAGGTCGCCCCAGGTCCAGCGGGTCCTCCGACGGGAACTACGC 
2461 AAACAGGTGACGGTGTACCAGCGCACCCCCAACCTGGCCTTGCCCATGCATCAGAAGCAG 

TTTGTCCACTGCCACATGGTCGCGTGGGGGTTGGACCGGAACGGGTACGTAGTCTTCGTC 
2521 CTCAGCGCCGAGGACAATCTGCGCATGAAGCCCGAGCTTCCCGCAGCGTTCGAGAGACGC 

GAGTCGCGGCTCCTGTTAGACGCGTACTTCGGGCTCGAAGGGCGTCGCAAGCTCTCTGCG 
2581 GGCAAGTGCTTCGCCGGCTTCGACTTCGACTTCATCGCCAAGAACGCGACCGAGCTGTCC 

CCGTTCACGAAGCGGCCGAAGCTGAAGCTGAAGTAGCGGTTCTTGCGCTGGCTCGACAGG 
2641 GCTGCGGAGCGCACAGAGATCTTGGAAGAGCTGTGGAACGCCGGCGGCTTCCGCTACTGG 

CGACGCCTCGCGTGTCTCTAGAACCTTCTCGACACCTTGCGGCCGCCGAAGGCGATGACC 
2701 CTGGCCAATTTCCAAGACTATCTGTTCGATGACAAGGCCAACGATTACGTCTACGAGTTC 

GACCGGTTAAAGGTTCTGATAGACAAGCTACTGTTCCGGTTGCTAATGCAGATGCTCAAG 
2761 TGGCGCGACAAGGTCCGCGCCCGCATCAAGGATCCGAAAGTTGCCGAGAAGCTCGCCCCC 

ACCGCGCTGTTCCAGGCGCGGGCGTAGTTCCTAGGCTTTCAACGGCTCTTCGAGCGGGGG 
2821 ATGAAGAAGCCGCATCCTTACGGAGCCAAGCGCCCTTCGCTGGAGCAGTGGTACTACGAG 

TACTTCTTCGGCGTAGGAATGCCTCGGTTCGCGGGAAGCGACCTCGTCACCATGATGCTC 
2881 ATCTTC AATC AGAACAACGTCACGCTGGTGGATGTCAACGAAACACCGGTGCTTCGCATC 

TAGAAGTTAGTCTTGTTGCAGTGCGACCACCTACAGTTGCTTTGTGGCCACGAAGCGTAG 
2941 ACCGAGAAAGGCATCGTGACCGCTGAGGGTGAAGCCGAATTCGACCTGATCGTGTTCGCG 

t~ xl 1 ".'nil 
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TGGCTCTTTCCGTAGCACTGGCGACTCCCACTTCGGCTTAAGCTGGACTAGCACAAGCGC 
3001 ACCGGCTTCGACGCAGTGACCGGGGGACTCACCAGCATCGACTTCCGCAACAACCAGGGC 

TGGCCGMGCTGCGTCACTGGCCCCCTGAGTGGTCGTAGCTGAAGGCGTTGTTGGTCCCG 
3061 CAGAGCTTCAAGGATGTGTGGTCTGACGGAATCCGCACCCAGCTGGGAGTGGCCACGGCA 

GTCTCGAAGTTCCTACACACCAGACTGCCTTAGGCGTGGGTCGACCCTCACCGGTGCCGT 
3121 GGTTTTCCCAACTTGCTCTTTGGCTACGGACCTCAATCGCCTGCGGGCTTCTGCAACGGT 

CCAAAAGGGTTGAACGAGAAACCGATGCCTGGAGTTAGCGGACGCCCGAAGACGTTGCCA 
3181 CCGTCGAGCGCCGAATACCAGGGCGATCTGCTGATCCAGCTGATGAACTACCTACGCGAC 

GGCAGCTCGCGGCTTATGGTCCCGCTAGACGACTAGGTCGACTACTTGATGGATGCGCTG 
3241 AACAACATCTCGCGCATCGAAGCCCAGTCCGAGGCACAGGAAGAATGGAGCAAGCTGATC 

TTGTTGTAGAGCGCGTAGCTTCGGGTCAGGCTCCGTGTCCTTCTTACCTCGTTCGACTAG 
330 1 GCAGACTTCTGGGACAGCTCGCTGTTCCCCCGCGCAAAGTCCTGGTACCAAGGATCCAAC 

CGTCTGAAGACCCTGTCGAGCGACAAGGGGGCGCGTTTCAGGACCATGGTTCCTAGGTTG 
3361 ATCCCGGGCAAGAAAGTCGAGAGCCTGAACTTCCCGCTGGGGCTGCCAACCTATATATCC 

TAGGGCCCGTTCTTTCAGCTCTCGGACTTGAAGGGCGACCCCGACGGTTGGATATATAGG 
3421 AAATTCAATGAAAGCGCTGAAAAAGGATATGCAGGCTTCTCGCTGGCCAGCTAAGACTCT 

TTTAAGTTACTTTCGCGACTTTTTCCTATACGTCCGAAGAGCGACCGGTCGATTCTGAGA 
3481 GTTGTGCAACTCCTGGAGACAAGCATATGGGACGTGTAAATGACAAAGTGGTTCTCGTCA 

CAACACGTTGAGGACCTCTGTTCGTATACCCTGCACATTTACTGTTTCACCAAGAGCAGT 
3541 CTGGCGGTGCCATGGGAATGGGCCTGACCCATTGCACCTTGCTGGCCCGCGAGGGGGCCA 

GACCGCCACGGTACCCTTACCCGGACTGGGTAACGTGGAACGACCGGGCGCTCCCCCGGT 
3601 CCGTTTACCTCAGCGACATGAATGAGGAACTGGGCCACCAAGCCGTAGCGGAGATCCGTC 

GGCAAATGGAGTCGCTGTACTTACTCCTTGACCCGGTGGTTCGGCATCGCCTCTAGGCAG 
3661 GGCAAGGTGGCAAGGCCCACTTCCTGCACCTGGACGTGACCAACGAGAATCACTGGACAG 

CCGTTCCACCGTTCCGGGTGAAGGACGTGGACCTGCACTGGTTGCTCTTAGTGACCTGTC 

— j Z5E 
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3721 GTGCCGTGGACACCATCCTCGCCGAGTCCGACCGGCTGGACGCGCTGGTCAACAACGCCG 

CACGGCACCTGTGGTAGGAGCGGCTCAGGCTGGCCGACCTGCGCGACCAGTTGTTGCGGC 
3781 GCATTTTGACCCTCAAACCCGTTCAGGACACCAGCAACGAAGAATGGGATCGCATCTTTG 

CGTAAAACTGGGAGTTTGGGCAAGTCCTGTGGTCGTTGCTTCTTACCCTAGCGTAGAAAC 
3841 AAATCAATGTCCGCAGTGTGTTCTTGGGCACGCGAGCAGTGATCGAGCCCATGCGTAAAG 

TTTAGTTACAGGCGTCACACAAGAACCCGTGCGCTCGTCACTAGCTCGGGTACGCATTTC 
3901 CCCATAAAGGCTGCATCGTCAATGTGTCGTCCATCTACGGTCTGGTGGGTGCGCCGGGAG 

GGGTATTTCCGACGTAGCAGTTACACAGCAGGTAGATGCCAGACCACCCACGCGGCCCTC 
3961 CTGCGGCCTACGAGGCCTCCAAAGGTGCCGTGCGCTTGTTTACCAAGGCTTGCGCGGTGG 

GACGCCGGATGCTCCGGAGGTTTCCACGGCACGCGAACAAATGGTTCCGAACGCGCCACC 
4021 ACCTGGCACCGTTCAATATTCGGGTCAACTCGGTCCACCCAGGGGTGATCGCAACACCCA 

TGGACCGTGGCAAGTTATAAGCCCAGTTGAGCCAGGTGGGTCCCCACTAGCGTTGTGGGT 
4081 TGACGCAGCAGATCCTGGACGCCCCACAGAGCGCACGTGCGCTTCTGGGCCCCACGCTGC 

ACTGCGTCGTCTAGGACCTGCGGGGTGTCTCGCGTGCACGCGAAGACCCGGGGTGCGACG 
4141 TGGGTCGTGCGGCCCAGCCGATGGAGGTCTCCCAGGCGGTGCTTTTCCTGGTCTCGGACG 

ACCCAGCACGCCGGGTCGGCTACCTCCAGAGGGTCCGCCACGAAAAGGACCAGAGCCTGC 
4201 AGGCCTCGTTCGTGCATGGCTCTGAACTCGTCGTCGACGGAGGGTATACGGCGAACTGAG 

TCCGGAGCAAGCACGTACCGAGACTTGAGCAGCAGCTGCCTCCCATATGCCGCTTGACTC 
4261 AAAGGAAC ATCCT CGGC ATGC 

TTTCCTTGTAGGAGCCGTACG 

■t-xi — / ~Z5F 
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SEQUENCE LISTING 

<110> NATIONAL RESEARCH COUNCIL OF CANADA 
IWAKI, Hiroaki 
HASEGAWA, Yoshie 
LAU, Peter C.K. 

<120> Cloning, Sequencing and Expression of a 

Comamonas Cyclopentanone 1, 2 -Monooxygenase -Encoding Gene in 
Escherichia coli 



<130> 2139-16PCT 

<150> US 60/218, 842 
<151> 2000-07-18 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primers for Comamonas sp. 

<221> modif ied_base 
<222> (3) . . . (3) 
<223> I 

<221> modif ied_base 
<222> (6) . . . (6) 
<223> I 

<221> modif ied_base 
<222> (12) . . . (12) 
<223> I 

<221> unsure 
<222> (15) . . . (15) 
<223> N = any base 

<400> 1 

acnacnatga cnacnatgac 

<210> 2 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primers for Comamonas sp . 

<221> modif ied_base 
<222> (9) . . . (9) 
<223> I 
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<400> 2 

arrtgrtana rytgrta 

<210> 3 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primers for Comamonas sp. 
<400> 3 

aaaaggcctg aacttcaatt atttaggaga c 

<210> 4 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primers for Comamonas sp. 
<400> 4 

aaaactgcag gagttgcaca acagagtctt ag 

<210> 5 
<211> 550 
<212> PRT 

<213> Comamonas sp 



<400> 5 



Met 


Thr 


Thr Met Thr 


Thr Met 


Thr 


Thr 


Glu Gin Leu Gly Met Asn Asn 


1 




5 








10 15 


Ser 


Val 


Asn Asp Lys 


Leu Asp 


Val 


Leu 


Leu He Gly Ala Gly Phe Thr 






20 






25 


30 


Gly 


Leu 


Tyr Gin Leu 


Tyr His 


Leu 


Arg 


Lys Leu Gly Tyr Lys Val His 




35 




40 




45 


Leu 


Val 


Asp Ala Gly 


Ala Asp 


He 


Gly 


Gly He Trp His Trp Asn Cys 




50 




55 






60 


Tyr 


Pro 


Gly Ala Arg 


Val Asp 


Thr 


His 


Cys Gin He Tyr Gin Tyr Ser 


65 




70 






75 80" 


He 


Pro 


Glu Leu Trp 


Gin Glu 


Phe 


Asn 


Trp Lys Glu Leu Phe Pro Asn 






85 








90 95 


Trp 


Ala 


Gin Met Arg 


Glu Tyr 


Phe 


His 


Phe Ala Asp Lys Lys Leu Asp 




100 






105 


110 


Leu 


Ser 


Lys Asp He 


Ser Phe 


Asn 


Thr 


Arg Val Gin Ser Ala Val Phe 






115 




120 




125 


Asp 


Glu 


Gly Thr Arg 


Glu Trp 


Thr 


Val 


Arg Ser He Gly His Gin Pro 




130 




135 






140. 


He 


Gin 


Ala Arg Phe 


Val He 


Ala 


Asn 


Leu Gly Phe Gly Ala Ser Pro 


145 




150 






155 160 


Ser 


Thr 


Pro Asn Val 


Asp Gly 


He 


Glu 


Thr Phe Lys Gly Gin Trp Tyr 






165 








170 175 


His 


Thr 


Ala Leu Trp 


Pro Gin 


Glu 


Gly 


Val Asn Met Ala Gly Lys Arg 






180 






185 


190 


Val 


Ala 


He He Gly 


Thr Gly 


Ser 


Ser 


Gly Val Gin Val Ala Gin Glu 






195 




200 




205 


Ala 


Ala 


Leu Asp Ala 


Lys Gin 


Val 


Thr 


Val Tyr Gin Arg Thr Pro Asn 




210 


215 






220 
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Leu 


Ala 


Leu 


Pro 


Met 


His 


Gin 


Lys 


225 










230 






Arg 


Met 


Lys 


Pro 


Glu 


Leu 


Pro 


Ala 










245 








Phe 


Ala 


Gly 


Phe Asp 


Phe 


Asp 


Phe 








260 










Ser 


Ala 


Ala 


Glu Arg Thr 


Glu 


lie 






275 










280 


Gly 


Phe 


Arg 


Tyr Trp 


Leu 


Ala 


Asn 




290 










295 




Lys 


Ala 


Asn 


Asp Tyr Val 


Tyr 


Glu 


305 










310 






Arg 


He 


Lys 


Asp 


Pro 


Lys 


Val 


Ala 










325 








Pro 


His 


Pro 


Tyr Gly Ala 


Lys 


Arg 








340 










Glu 


He 


Phe 


Asn 


Gin 


Asn 


Asn 


Val 






355 










360 


Pro 


Val 


Leu 


Arg 


He 


Thr 


Glu 


Lys 




370 


• 








375 




Ala 


Glu 


Phe 


Asp 


Leu 


He 


Val 


Phe 


385 










390 






Gly 


Gly 


Leu 


Thr 


Ser 


He 


Asp 


Phe 










405 








Lys 


Asp 


Val 


Trp 


Ser Asp 


Gly 


He 








420 










Ala 


Gly 


Phe 


Pro 


Asn 


Leu 


Leu 


Phe 






435 










440 


Gly 


Phe 


Cys 


Asn Gly 


Pro 


Ser 


Ser 




450 




• 






455 




He 


Gin 


Leu 


Met 


Asn 


Tyr 


Leu 


Arg 


465 










470 






Ala 


Gin 


Ser 


Glu 


Ala 


Gin 


Glu 


Glu 










485 








Trp 


Asp 


Ser 


Ser 


Leu 


Phe 


Pro 


Arg 








500 










Asn 


He 


Pro 


Gly Lys 


Lys 


Val 


Glu 






515 










520 


Pro 


Thr 


Tyr 


He 


Ser 


Lys 


Phe 


Asn 




530 










535 




Gly 


Phe 


Ser 


Leu 


Ala 


Ser 






545 










550 







Gin 


Leu 


Ser Ala Glu Asp Asn 


Leu 






235 


240 


Ala 


Phe 


Glu Arg Arg Gly Lys 


Cys 




250 


255 




He 


Ala 


Lys Asn Ala Thr Glu 


Leu 


265 




270 




Leu 


Glu 


Glu Leu Trp Asn Ala 


Gly 






285 




Phe 


Gin 


Asp Tyr Leu Phe Asp 


Asp 






300 




Phe 


Trp 


Arg Asp Lys Val Arg 


Ala 






315 


320 


Glu 


Lys 


Leu Ala Pro Met Lys 


Lys 




330 


335 




Pro 


Ser 


Leu Glu Gin Trp Tyr 


Tyr 


345 




350 




Thr 


Leu 


Val Asp Val Asn Glu 


Thr 






365 




Gly 


He 


Val Thr Ala Glu Gly 


Glu 






380 




Ala 


Thr 


Gly Phe Asp Ala Val 


Thr 






395 


400 


Arg 


Asn 


Asn Gin Gly Gin Ser 


Phe 




410 


415 




Arg 


Thr 


Gin Leu Gly Val Ala 


Thr 


425 




430 




Gly Tyr 


Gly Pro Gin Ser Pro 


Ala 






445 




Ala 


Glu 


Tyr Gin Gly Asp Leu 


Leu 






460 




Asp 


Asn 


Asn He Ser Arg He 


Glu 






475 


480 


Trp 


Ser 


Lys Leu He Ala Asp 


Phe 




490 


* 495 




Ala 


Lys 


Ser Trp Tyr Gin Gly 


Ser 


505 




510 




Ser 


Leu 


Asn Phe Pro Leu Gly 


Leu 






525 




Glu 


Ser 


Ala Glu Lys Gly Tyr 


Ala 



540 



.<210> 6 
<211> 549 
<212> PRT 

<213> Rhodococcus rhodochrous 



<400> 6 



Met Asn Gly Gin His 


Pro Arg 


Ser 


Val 


Val 


Thr Ala Pro Asp Ala 


Thr 


1 5 








10 


15 




Thr Gly Thr Thr Ser 


Tyr Asp 


Val 


Val 


Val 


Val Gly Ala Gly lie 


Ala 


20 






25 




30 




Gly Leu Tyr Ala lie 


His Arg 


Phe 


Arg 


Ser 


Gin Gly Leu Thr Val 


Arg 


35 




40 






45 




Ala Phe Glu Ala Ala 


Ser Gly Val Gly Gly 


Val Trp Tyr Trp Asn 


Arg 


50 


55 








60 


* 


Tyr Pro Gly Ala Arg Cys Asp 


Val 


Glu 


Ser 


He Asp Tyr Ser Tyr 


Ser 


65 


70 








75 


80 



I 
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Phe Ser Pro Glu 

Thr Gin Pro Glu 

100 

Asp Leu Arg Arg 
115 

Leu Asp Glu Glu 
130 

Glu Val Ser Ala 
145 

Ala Asn Thr Pro 

Val His Thr Ala 

180 

Arg Val Gly Val 
195 

lie He Ala Glu 
210 

Asn Tyr Ser He 
225 

Ala Glu Gin Lys 

Ser Gly Gly Gly 

260 

Val Ser Glu Glu 
275 

Gly Gly Val Leu 
290 

Ala Ala Asn Asp 
305 

Val Val Asp Asp 

Ala He Gly Ala 

340 

Tyr Asn Arg Asp 
355 

Val Gly Met Asp 
370 

Leu Asp Met He 
385 

Leu Asp Lys Leu 

Thr Trp Ala Ala 

420 

Phe Pro Asn Phe 
435 

Ala Asn Met Val 
450 

Ala He Ala Tyr 
465 

Pro Glu Ala Val 

Ala Ser Leu Leu 

500 

Pro Gly Arg Pro 
515 

Tyr Arg Glu He 
530 

Ala He Leu Glu 
545 



Leu 


Glu 


Gin 


Glu 


85 








He 


Leu 


Ala 


Tyr 


Asp 


He 


Arg 


Phe 








120 


Gly 


Leu Arg 


Trp 






135 




Arg 


Phe 


Leu 


Val 




150 






Ala 


Phe Asp 


Gly 


165 








Arg 


Trp 


Pro 


His 


He 


Gly Thr Gly 








200 


Gin 


Ala 


Glu 


Gin 




• 


215 




Pro 


Ala 


Gly Asn 




230 






Ala 


Asn Tyr Ala 


245 








Ser 


Pro 


His 


Arg 


Glu 


Arg Arg 


Ala 








280 


Phe 


Ser 


Lys 


Ala 






295 




Thr 


Ala 


Arg 


Ala 




310 






Pro 


Ala 


Val 


Ala 


325 








Lys 


Arg 


He 


Val 


Asn 


Val 


Glu 


Leu 








360 


Glu 


Thr 


Gly 


He 






375 




Val 


Leu 


Ala 


Thr 




390 






Glu 


He Val Gly 


405 








Gly 

mm 

* 


Pro Arg Thr 


Phe 


Asn 


Leu 


Thr 








440 


Leu 


His 


Ser 


Glu 






455 




Leu 


Asp 


Ala 


Arg 




470 






Ala 


Asp 


Trp 


Val 


485 








Asn 


Ser 


Ala 


Asn 


Arg 


Val 


Phe 


Met 








520 


He 


Thr 


Glu 


Val 






535 




Gly 









Trp 


Asn 


Trp Ser 




90 




Leu 


Glu 


His Val 


105 






Asp 


Thr 


Arg Val 


Thr 


Val 


Arg Thr 






140 


Val 


Ala 


Ala Gly 






155 


Leu Asp 


Arg Phe 




170 




Asp 


Gly 


Val Asp 


185 






Ser 


Ser 


Gly He 


Leu 


Phe 


Val Phe 






220 


Val 


Pro 


Leu Asp 






235 


Glu 

V-J 


Ara 


Arg Arg 




250 




Pro 


His 


Pro Lys 


265 






Val 


Tyr 


Glu Glu 


Phe 


Pro 


Asp Gin 






300 


Phe 


Trp 


Glu Glu 






315 


Glu 


Leu 


Leu Thr 




330 




Thr 


Asp 


Ser Gly 

mm 


345 






Val 

4 


Asp 


Leu Arg 


Val 


Thr 


Thr Gly 






380 


Gly Phe 


Asp Ala 






395 


Arg Gly 


Gly Arg 




410 




Tyr 


Leu 


Gly Leu 


425 






Gly 


Pro 


Gly Ser 

mm 


Leu 


His 


Val Asp 






460 


Gly Ala 


Ala Gly 






475 


Glu 


Glu 


Cys Arg 




490 




Ser 


Trp 


Tyr Leu 


505 






Pro 


Phe 


Leu Gly 


Ala 


Glu 


Ser Gly 






540 



Glu 


Lys 


Tyr Ala 






QC 


Ala 


Asp 


Arg Phe 




110 




Thr 


Ser 


Ala Val 


125 






Asp 


Arg 


Gly Asp 


Pro 


Leu 


Ser Asn 






160 


Thr 


Gly 


Asp He 






175 


Phe 


Thr 


Gly Lys 




190 




Gin 


Ser 


He Pro 


205 






Gin 


Arg 


Ser Ala 


Asp 


Ala 


Thr Arg 






240 


Leu 


Ser 


Arg Glu 






255 


Ser 


Ala 


Leu Glu 




270 




Arg 


Trp 


Lys Leu 


285 






Leu 


Thr 


Asp Pro 


Lys 

mm 


He 


Arg Ala 






320 


Pro 


Lys 


Asp His 






335 


Tyr 


Tyr 


Glu Thr 




350 




Ser 


Thr 


Pro He 


365 






Ala 


His 


Tyr Asp 


Met 


Thr 


Gly Ser 






400 


Thr 


Leu 


Lys Glu 






415 


Gly 


He 


Asp Gly 




430 




Pro 


Ser 


Val Leu 


445 






Trp 

■ 


Val 


Ala Asp 


He 


Glu 


Gly Thr 






480 


Asn 


Arg 


Ala Glu 






495 


Gly 


Ala 


Asn He 




510 




Gly 


Phe 


Gly Val 


525 






Tyr 


Lys 


Gly Phe 
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<210> 7 
<211> 543 
<212> PRT 

<213> Acinetobacter sp 
<400> 7 



Met 


Ser 


Gin 


Lys 


Met 


Asp 


Phe Asp Ala 


He 


Val 


He Gly 


Gly Gly 


Phe 


1 








5 






10 








15 




Gly 


Gly 


Leu 


Tyr 


Ala 


Val 


Lys Lys Leu 


Arg Asp 


Glu 


Leu 


Glu Leu 


Lys 








20 






25 










30 




Val 


Gin 


Ala 


Phe 


Asp 


Lys 


Ala Thr Asp 


Val 


Ala 


Gly Thr 


Trp Tyr 


Trp 






35 








40 








45 






Asn 


Arg 
50 


Tyr 


Pro 


Gly 


Ala- 


Leu Thr Asp 
55 


Thr 


Glu 


Thr 
60 


His 


Leu Tyr 


Cys 


Tyr 


Ser 


Trp 


Asp 


Lys 


Glu 


Leu Leu Gin 


Ser 


Leu 


Glu 


He 


Lys Lys 


Lys 


65 










70 






75 








80 


Tyr 


Val 


Gin 


Gly 


Pro 


Asp Val Arg Lys 


Tyr 


Leu 


Gin 


Gin 


Val Ala 


Glu 










85 






90 








95 




• Lys 


His 


Asp 


Leu 
100 


Lys 


Lys 


Ser Tyr Gin 
105 


Phe 


Asn 


Thr 


Ala 


Val Gin 
110 


Ser 


Ala 


His 


Tyr 


Asn 


Glu 


Ala Asp Ala Leu 


Trp Glu 


Val 


Thr 


Thr Glu 


Tyr 






115 








120 








125 






Gly 


Asp 
130 


Lys 


Tyr 


Thr 


Ala 


Arg Phe Leu 
135 


He 


Thr 


Ala 
140 


Leu 


Gly Leu 


Leu 


Ser 


Ala 


Pro 


Asn 


Leu 


Pro 


Asn He Lys 


Gly 


He 


Asn 


Gin 


Phe Lys 


Gly 


145 










150 






155 








160 


Glu 


Leu 


His 


His 


Thr 


Ser Arg Trp Pro 


Asp Asp 


Val 


Ser 


Phe Glu 


Gly 










165 






170 








175 




Lys 


Arg 


Val 


Gly 


Val 


lie Gly Thr Gly 


Ser 


Thr 


Gly Val 


Gin Val 


He 








180 






185 










190 




Thr 


Ala 


Val 


Ala 


Pro 


Leu Ala Lys His 


Leu 


Thr 


Val 


Phe 


Gin Arg 


Ser 






195 








200 








205 






Ala 


Gin 
210 


Tyr 


Ser 


Val 


Pro 


He Gly Asn 
215 


Asp 


Pro 


Leu 
220 


Ser 


Glu Glu 


Asp 


Val 


Lys 


Lys 


lie 


Lys 


Asp Asn Tyr Asp 


Lys 


Ser 


Leu Gly 


Trp Cys 


Met 


225 










230 






235 








240 


Asn 


Ser 


Ala 


Leu 


Ala 
245 


Phe 


Ala Leu Asn 


Glu 
250 


Ser 


Thr 


Val 


Pro Ala 
255 


Met 


Ser 


Val 


Ser 


Ala 


Glu 


Glu Arg Lys Ala 


Val 


Phe 


Glu 


Lys 


Ala Trp 


Gin 








260 






. 265 










270 




Thr 


Gly 


Gly 


Gly 


Phe 


Arg' 


Phe Met Phe 


Glu 


Thr 


Phe Gly 


Asp He 


Ala 






275 








280 








285 






Thr 


Asn 
290 


Met 


Glu 


Ala 


Asn 


He Glu Ala 
295 


Gin 


Asn 


Phe 
300 


He 


Lys Gly 


Lys 


lie 


Ala 


Glu 


lie 


Val 


Lys 


Asp Pro Ala 


He 


Ala 


Gin 


Lys 


Leu Met 


Pro 


305 










310 






315 








320 


Gin 


Asp 


Leu 


Tyr 


Ala 


Lys 


Arg Pro Leu 


Cys Asp 


Ser Gly 


Tyr Tyr 


Asn 










325 






330 








335 




Thr 


Phe 


Asn 


Arg 


Asp 


Asn 


Val Arg Leu 


Glu Asp 


Val 


Lys 


Ala Asn 


Pro 








340 






345 










350 




lie 


Val 


Glu 
355 


lie 


Thr 


Glu 


Asn Gly Val 
360 


Lys 


Leu 


Glu 


Asn 
365 


Gly Asp 


Phe 


Val 


Glu 


Leu 


Asp 


Met 


Leu 


He Cys Ala 


Thr Gly 


Phe 


Asp 


Ala Val 


Asp 




370 










375 






380 








Gly 


Asn 


Tyr 


Val 


Arg 


Met 


Asp He Gin 


Gly Lys 


Asn Gly 


Leu Ala 


Met 


385 










390 






395 








400 


Lys 


Asp 


Tyr 


Trp 


Lys 
405 


Glu 


Gly Pro Ser 


Ser 
410 


Tyr 


Met 


Gly 


Val Thr 
415 


Val 
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Asn 


Asn 


Tyr Pro 


Asn 


Met 


Phe 


Met 


Val 


Leu Gly Pro Asn Gly Pro Phe 






420 










425 


430 


Thr 


Asn 


Leu Pro 


Pro 


Ser 


He 


Glu 


Ser 


Gin Val Glu Trp He Ser Asp 






435 








440 




445 


Thr 


lie 


Gin Tyr 


Thr 


Val 


Glu 


Asn 


Asn 


Val Glu Ser He Glu Ala Thr 




450 






455 






460 


Lys 


Glu 


Ala Glu 


Glu 


Gin 


Trp 


Thr 


Gin 


Thr Cys Ala Asn He Ala Glu 


.465 








470 








475 480 


Met 


Thr 


Leu Phe 


Pro 


Lys 


Ala 


Gin 


Ser 


Trp He Phe Gly Ala Asn He 








485 










490 495 






Lys Lys 


Asn 


Thr 


Val 

V CL J. 




Phe 

rue 


TVr Leu Glv Glv Leu Lys Glu 




500 










505 


510 


Tyr 


Arg 


Thr Cys 


Ala 


Ser 


Asn 


Cys 


Lys 


Asn His Ala Tyr Glu Gly Phe 






515 








520 




525 


Asp 


He 


Gin Leu Gin Arg 


Ser 


Asp 


He 


Lys Gin Pro Ala Asn Ala 




530 








535 






540 


<210> 8 

















<211> 4281 
<212> DNA 

<213> Comamonas sp 

<220> 
<221> CDS 

<222> (1822) . . . (3471) 
<400> 8 

geatgegata gcggcacgtt ggtgggcggg cgagcgcggc cacgagcgcg tctctcgctc 60 

caccgacatc acatcgccgg tcaccggcga gaacattggc aagatcctgc gcttttcgcc 12 0 

getgaagecg cagggctacg agagaatttt tcagtccggc ctggcggact cgacccagca 180 

cacccccagg atggtgcgtt teggacagtt cctgtcctgt gatccagaga ccatcaacac 240 

cctggagaca ttggagcgga tcgcgcgcgc egaegtcaat gtcttgctgc atggagagac 300 

gggcaeggge aaagagctga tagctcgcca cattcatgtc gcgagccgcc ggcgggatgc 360 

gccctacctc gccatcaact gcggggcaat cagcteggag ttgctggaga gtactttttt 42 0 

tggctatgtg cgeggagcat tctccggggc agatcccaag ggccgcgccg gctactttga 48 0 

atcggtgggc gaaggcacct tgtttctgga egagategge gagctgccct tggecatgea 54 0 

ggcggcattg ttgcgcgtgc tggaagaegg aagctttctg cgtgtgggct catcgacccc 600 

gcagcgcgcc tcgtgccgca ttatcgegge cacgcaccgc aatctggagg aactcatcgc 660 

acagggectg tttcgecagg atctctacta ccgtctcaag ategttcaga aaaggctcaa 72 0 

gecgataegg gaacgcacct gtgacatege ettgetggea gagcaattca aatctgeact 780 

ggcgcaaaag caccagatcc egaaegtgea gatccacccc gaggecatgg cagtcatgga 840 

gcgctaccag tggcegggea atgegegega aatccgcaat gtgatggaag ccgccctgat 900 

ctgctccgat ggtgaaatca cgctcgccag cctgcccccg gaagtatctg agaactccac 960 

ctatcctctg caaagceggg tggecgaage gaatagegag atgcctcccg tctccggcaa 102 0 

cgactatgaa cggcaactca tcgtgggttt getgegcaag taccgaaagg tcaatcatgt 1080 

ggegagggee ctgggcctcg cgcgctccac getctategg aaattcgcag acttgggcat 1140 

tgaccaacgc gagtatgtca atgatctgtc cgattaacac tgctgcgccc tgcgcttgcg 1200 

geaaegcagt ategteggaa cgcgggcaac gctaaaggca gccgccgtcg gcatcggttg 1260 

ccgcccggtg ttgtttgtcg gctgagaccg cctgcagcac cagcccctcc attcctegtg 132 0 

tctagatctc cttgcaacgc tgtcacagcg tgetttttte gtggcgcggg acttctcgcg 1380 

atgagggect geaagggegg cgcatcaaaa caagcgccct gtcctcagga cgtttgcgcc 1440 

gtcgctggct atgggcagtc tgeatgatec gcttgggctt gcccaccatg cgcgcgactg 1500 

cgctctttcc tgttgggcgc atcgtcatcc cgaccttcct gaaatccatt tccacagggc 1560 

cggcctgagc cagacagega teggcagget ctcgcatctc ggacagtggt gcgctacacc 1620 

acaggacaca agtgtccggc gcgtggcacg cagcggcctt tgctgcgcac gatgegtget 1680 

tttggcacat ggaaaccctc ggtttttccg aaccagagac ctggcacaag tcttgataac 1740 

tcctgagtga ctgatgeaac gcatctcagt ccgggagcgt ggacgacacg cccctgaact 1800 

tcaattattt aggagaccca t atg acc acc atg acc acc atg acc acc gaa 1851 

Met Thr Thr Met Thr Thr Met Thr Thr Glu 
1 5 10 
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caa etc ggc atg aac aac tct gtc aat gac aag ctt gac gtt ttg etc 1899 
Gin Leu Gly Met Asn Asn Ser Val Asn Asp Lys Leu Asp Val Leu Leu 

15 20 25 

ate ggc gee ggc ttc acc ggt etc tac cag etc tat cac ctg cgc aag 1947 
lie Gly Ala Gly Phe Thr Gly Leu Tyr Gin Leu Tyr His Leu Arg Lys 

30 35 40 

ctg ggc tac aag gtt cat etc gtc gac gee ggt gee gat att ggc ggg 1995 
Leu Gly Tyr Lys Val His Leu Val Asp Ala Gly Ala Asp lie Gly Gly 
45 50 55 

ate tgg cat tgg aac tgc tac ccc gga gcg cgt gtg gat acc cac tgc 2 043 

lie Trp His Trp Asn Cys Tyr Pro Gly Ala Arg Val Asp Thr His Cys 
60 65 70 



cag ate tac cag tac tec att cca gag ttg tgg cag gag ttc aac tgg 
Gin lie Tyr Gin Tyr Ser He Pro Glu Leu Trp Gin Glu Phe Asn Trp 
75 80 85 90 



2091 



aaa gag ctg ttc cct aac tgg gcg caa atg cgc gag tat ttc cat ttt 2139 
Lys Glu Leu Phe Pro Asn Trp Ala Gin Met Arg Glu Tyr Phe His Phe 

95 100 105 

gee gac aag aag etc gac ctg age aag gac ate age ttc aac acc cgt 2187 
Ala Asp Lys Lys Leu Asp Leu Ser Lys Asp He Ser Phe Asn Thr Arg 

110 115 120 

gtg cag teg gee gtc ttt gac gaa ggc aca cgc gaa tgg acg gta cgc 223 5 

Val Gin Ser Ala Val Phe Asp Glu Gly Thr Arg Glu Trp Thr Val Arg 
125 130 135 

teg ate gga cac cag ccg ate cag gee agg ttc gtc ate gee aac ctt 22 83 

Ser lie Gly His Gin Pro He Gin Ala Arg Phe Val He Ala Asn Leu 
140 145 150 

ggc ttc ggt gee age ccc age acg ccc aat gtc gat ggc ate gag aca 2331 
Gly Phe Gly Ala Ser Pro Ser Thr Pro Asn Val Asp Gly He Glu Thr 
155 160 165 170 

ttc aaa ggc cag tgg tat cac act get ctg tgg ccc cag gaa ggc gtg 2379 
Phe Lys Gly Gin Trp Tyr His Thr Ala Leu Trp Pro Gin Glu Gly Val 

175 180 185 

aac atg gee ggc aag cgc gtg gee ate att ggc acc ggc tec age ggg 2427 
Asn Met Ala Gly Lys Arg Val Ala He He Gly Thr Gly Ser Ser Gly 

190 195 200 

gtc cag gtc gee cag gag get gee ctt gat gcg aaa cag gtg acg gtg 2475 
Val Gin Val Ala Gin Glu Ala Ala Leu Asp Ala Lys Gin Val Thr Val 
205 210 215 

tac cag cgc acc ccc aac ctg gee ttg ccc atg cat cag aag cag etc 2523 
Tyr Gin Arg Thr Pro Asn Leu Ala Leu Pro Met His Gin Lys Gin Leu 
220 225 230 

age gee gag gac aat ctg cgc atg aag ccc gag ctt ccc gca gcg ttc 2571 
Ser Ala Glu Asp Asn Leu Arg Met Lys Pro Glu Leu Pro Ala Ala Phe 
235 240 245 250 
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gag aga cgc ggc aag tgc ttc gcc ggc ttc gac ttc gac ttc ate gec 2619 
Glu Arg Arg Gly Lys Cys Phe Ala Gly Phe Asp Phe Asp Phe lie Ala 

255 260 265 

aag aac gcg acc gag ctg tec get gcg gag cgc aca gag ate ttg gaa 2667 
Lys Asn Ala Thr Glu Leu.Ser Ala Ala Glu Arg Thr Glu lie Leu Glu 

270 275 280 

gag ctg tgg aac gcc ggc ggc ttc cgc tac tgg ctg gcc aat ttc caa 2715 
Glu Leu Trp Asn Ala Gly Gly Phe Arg Tyr Trp Leu Ala Asn Phe Gin 
285 290 295 

gac tat ctg ttc gat gac aag gcc aac gat tac gtc tac gag ttc tgg 2763 
Asp Tyr Leu Phe Asp Asp Lys Ala Asn Asp Tyr Val Tyr Glu Phe Trp 
300 305 310 

cgc gac aag gtc cgc gcc cgc ate aag gat ccg aaa gtt gcc gag aag 2811 
Arg Asp Lys Val Arg Ala Arg lie Lys Asp Pro Lys Val Ala Glu Lys 
315 320 325 330 

etc gcc ccc atgaag aag ccg cat cct tac gga gcc aag cgc cct teg 2859 
Leu Ala Pro Met Lys Lys Pro His Pro Tyr Gly Ala Lys Arg Pro Ser 

335 340 345 



ctg gag cag tgg tac tac gag ate ttc aat cag aac aac gtc acg ctg 
Leu Glu Gin Trp Tyr Tyr Glu lie Phe Asn Gin Asn Asn Val Thr Leu 

350 355 360 



gtg acc get gag ggt gaa gcc gaa ttc gac ctg ate gtg ttc gcg acc 
Val Thr Ala Glu Gly Glu Ala Glu Phe Asp Leu lie Val Phe Ala Thr 
380 385 390 



aac cag ggc cag age ttc aag gat gtg tgg tct gac gga ate cgc acc 
Asn Gin Gly Gin Ser Phe Lys Asp Val Trp Ser Asp Gly lie Arg Thr 

415 420 425 
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gtg gat gtc aac gaa aca ccg gtg ctt cgc ate acc gag aaa ggc ate 2955 
Val Asp Val Asn Glu Thr Pro Val Leu Arg He Thr Glu Lys ply He 
365 370 375 



3003 



ggc ttc gac gca gtg acc ggg gga etc acc age ate gac ttc cgc aac 3051 
Gly Phe Asp Ala Val Thr Gly Gly Leu Thr Ser He Asp Phe Arg Asn 
395 • ' 400 405 ' 410 



3099 



cag ctg gga gtg gcc acg gca ggt ttt ccc aac ttg etc ttt ggc tac 3147 
Gin Leu Gly Val Ala Thr Ala Gly Phe Pro Asn Leu Leu Phe Gly Tyr 

430 435 440 

gga cct caa teg cct gcg ggc ttc tgc aac ggt ccg teg age gcc gaa 3195 
Gly Pro Gin Ser Pro Ala Gly Phe Cys Asn Gly Pro Ser Ser Ala Glu 
445 450 455 

tac cag ggc gat ctg ctg ate cag ctg atg aac tac eta cgc gac aac 3243 
Tyr Gin Gly Asp Le'u Leu He Gin Leu Met Asn Tyr Leu Arg Asp Asn 
460 465 470 

aac ate teg cgc ate gaa gcc cag tec gag gca cag gaa gaa tgg age- 3291 
Asn He Ser Arg He Glu Ala Gin Ser Glu Ala Gin Glu Glu Trp Ser 
475 480 485 490 
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aag ctg ate gca gac ttc tgg gac age teg ctg ttc c.cc cgc gca aag 3339 
Lys Leu lie Ala Asp Phe Trp Asp Ser Ser Leu Phe Pro Arg Ala Lys 

495 500 505 

tec tgg tac caa gga tec aac ate ccg ggc aag aaa gtc gag age ctg 3387 
Ser Trp Tyr Gin Gly Ser Asia lie Pro Gly Lys Lys Val Glu Ser Leu 

510 * 515 520 

aac ttc ccg ctg ggg ctg cca ace tat ata tec aaa ttc aat gaa age 3435 
As n Phe Pro Leu Gly Leu Pro Thr Tyr lie Ser Lys Phe Asn Glu Ser 
525 530 535 

get gaa aaa gga tat gca ggc ttc teg ctg gee age taagactctg 3481 
Ala Glu Lys Gly Tyr Ala Gly Phe Ser Leu Ala Ser 
540 545 550 

ttgtgcaact cctggagaca agcatatggg acgtgtaaat gacaaagtgg ttctcgtcac 3541 

tggcggtgcc atgggaatgg gcctgaccca ttgcaccttg ctggcccgcg agggggecac 3601 

cgtttacctc agegacatga 'atgaggaact gggccaccaa geegtagegg agatcegteg 3661 

gcaaggtggc aaggcccact tcctgcacct ggacgtgacc aacgagaatc actggacagg 3721 

tgccgtggac accatcctcg ccgagtccga ccggctggac gcgctggtca acaacgccgg 3781 

cattttgacc ctcaaacccg ttcaggacac cagcaacgaa gaatgggatc gcatctttga 3 841 

aatcaatgtc cgcagtgtgt tcttgggcac gcgagcagtg atcgagccca tgegtaaage 3901 

ccataaaggc tgeategtea atgtgtcgtc catctaeggt ctggtgggtg cgccgggagc 3961 

tgcggcctac gaggcctcca aaggtgccgt gcgcttgttt accaaggctt gcgcggtgga 4021 

cctggcaccg ttcaatattc gggtcaactc ggtccaccca ggggtgatcg caacacccat 4081 

gaegcagcag atcctggacg ccccacagag cgcacgtgcg cttctgggcc ccacgctgct 4141 

gggtcgtgcg gcccagccga tggaggtctc ecaggeggtg cttttcctgg teteggaega 42 01 

ggcctcgttc gtgcatggct ctgaactcgt cgtcgacgga gggtataegg cgaactgaga 4261 

aaggaacatc ctcggcatgc 42 81 
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